Search CORE

157 research outputs found

Multi-Edge Gene Set Networks Reveal Novel Insights into Global Relationships between Biological Themes

Author: Marto Jarrod
Parikh Jignesh R.
Xia Yu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

Curated gene sets from databases such as KEGG Pathway and Gene Ontology are often used to systematically organize lists of genes or proteins derived from high-throughput data. However, the information content inherent to some relationships between the interrogated gene sets, such as pathway crosstalk, is often underutilized. A gene set network, where nodes representing individual gene sets such as KEGG pathways are connected to indicate a functional dependency, is well suited to visualize and analyze global gene set relationships. Here we introduce a novel gene set network construction algorithm that integrates gene lists derived from high-throughput experiments with curated gene sets to construct co-enrichment gene set networks. Along with previously described co-membership and linkage algorithms, we apply the co-enrichment algorithm to eight gene set collections to construct integrated multi-evidence gene set networks with multiple edge types connecting gene sets. We demonstrate the utility of approach through examples of novel gene set networks such as the chromosome map co-differential expression gene set network. A total of twenty-four gene set networks are exposed via a web tool called MetaNet, where context-specific multi-edge gene set networks are constructed from enriched gene sets within user-defined gene lists. MetaNet is freely available at http://blaispathways.dfci.harvard.edu/metanet/

CiteSeerX

Public Library of Science (PLOS)

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

FigShare

Recommended from our members

Discovering Causal Signaling Pathways Through Gene-Expression Patterns

Author: Blüthgen Nils
Klinger Bertram
Marto Jarrod
Parikh Jignesh R.
Xia Yu
Publication venue: 'Oxford University Press (OUP)'
Publication date: 11/12/2012
Field of study

High-throughput gene-expression studies result in lists of differentially expressed genes. Most current meta-analyses of these gene lists include searching for significant membership of the translated proteins in various signaling pathways. However, such membership enrichment algorithms do not provide insight into which pathways caused the genes to be differentially expressed in the first place. Here, we present an intuitive approach for discovering upstream signaling pathways responsible for regulating these differentially expressed genes. We identify consistently regulated signature genes specific for signal transduction pathways from a panel of single-pathway perturbation experiments. An algorithm that detects overrepresentation of these signature genes in a gene group of interest is used to infer the signaling pathway responsible for regulation. We expose our novel resource and algorithm through a web server called SPEED: Signaling Pathway Enrichment using Experimental Data sets. SPEED can be freely accessed at http://speed.sys-bio.net/

Harvard University - DASH

Multiplierz: An Extensible API Based Desktop Environment for Proteomics Data Analysis

Author: Askenazi Manor
Blank Nathaniel C.
Cashorali Tanya
Ficarro Scott B.
Marto Jarrod A.
Parikh Jignesh R.
Webber James T.
Zhang Yi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

BACKGROUND. Efficient analysis of results from mass spectrometry-based proteomics experiments requires access to disparate data types, including native mass spectrometry files, output from algorithms that assign peptide sequence to MS/MS spectra, and annotation for proteins and pathways from various database sources. Moreover, proteomics technologies and experimental methods are not yet standardized; hence a high degree of flexibility is necessary for efficient support of high- and low-throughput data analytic tasks. Development of a desktop environment that is sufficiently robust for deployment in data analytic pipelines, and simultaneously supports customization for programmers and non-programmers alike, has proven to be a significant challenge. RESULTS. We describe multiplierz, a flexible and open-source desktop environment for comprehensive proteomics data analysis. We use this framework to expose a prototype version of our recently proposed common API (mzAPI) designed for direct access to proprietary mass spectrometry files. In addition to routine data analytic tasks, multiplierz supports generation of information rich, portable spreadsheet-based reports. Moreover, multiplierz is designed around a "zero infrastructure" philosophy, meaning that it can be deployed by end users with little or no system administration support. Finally, access to multiplierz functionality is provided via high-level Python scripts, resulting in a fully extensible data analytic environment for rapid development of custom algorithms and deployment of high-throughput data pipelines. CONCLUSION. Collectively, mzAPI and multiplierz facilitate a wide range of data analysis tasks, spanning technology development to biological annotation, for mass spectrometry-based proteomics research.Dana-Farber Cancer Institute; National Human Genome Research Institute (P50HG004233); National Science Foundation Integrative Graduate Education and Research Traineeship grant (DGE-0654108

Crossref

Boston University Institutional Repository (OpenBU)

Springer - Publisher Connector

PubMed Central

Recommended from our members

Proteomic Analysis Reveals CACN-1 Is a Component of the Spliceosome in Caenorhabditis elegans

Author: Adelmant Guillaume
Cecchetelli Alyssa D.
Cram Erin J.
Doherty Michael F.
Marto Jarrod A.
Publication venue: 'Genetics Society of America'
Publication date: 08/09/2014
Field of study

Cell migration is essential for embryonic development and tissue formation in all animals. cacn-1 is a conserved gene of unknown molecular function identified in a genome-wide screen for genes that regulate distal tip cell migration in the nematode worm Caenorhabditis elegans. In this study we take a proteomics approach to understand CACN-1 function. To isolate CACN-1−interacting proteins, we used an in vivo tandem-affinity purification strategy. Tandem-affinity purification−tagged CACN-1 complexes were isolated from C. elegans lysate, analyzed by mass spectrometry, and characterized bioinformatically. Results suggest significant interaction of CACN-1 with the C. elegans spliceosome. All of the identified interactors were screened for distal tip cell migration phenotypes using RNAi. Depletion of many of these factors led to distal tip cell migration defects, particularly a failure to stop migrating, a phenotype commonly seen in cacn-1 deficient animals. The results of this screen identify eight novel regulators of cell migration and suggest CACN-1 may participate in a protein network dedicated to high-fidelity gonad development. The composition of proteins comprising the CACN-1 network suggests that this critical developmental module may exert its influence through alternative splicing or other post-transcriptional gene regulation

Harvard University - DASH

Recommended from our members

Genome-scale Proteome Quantification by DEEP SEQ Mass Spectrometry

Author: Adelmant Guillaume
Ficarro Scott B.
Jiang Wenyu
Lu Yu
Luckey C. John
Marto Jarrod A.
Zhou Feng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/03/2014
Field of study

Advances in chemistry and massively parallel detection underlie DNA sequencing platforms that are poised for application in personalized medicine. In stark contrast, systematic generation of protein-level data lags well-behind genomics in virtually every aspect: depth of coverage, throughput, ease of sample preparation, and experimental time. Here, to bridge this gap, we develop an approach based on simple detergent lysis and single-enzyme digest, extreme, orthogonal separation of peptides, and true nanoflow LC-MS/MS that provides high peak capacity and ionization efficiency. This automated, deep efficient peptide sequencing and quantification (DEEP SEQ) mass spectrometry platform provides genome-scale proteome coverage equivalent to RNA-seq ribosomal profiling and accurate quantification for multiplexed isotope labels. In a model of the embryonic to epiblast transition in murine stem cells, we unambiguously quantify 11,352 gene products that span 70% of Swiss-Prot and capture protein regulation across the full detectable range of high-throughput gene expression and protein translation

Harvard University - DASH

Concordant and opposite roles of DNA-PK and the "facilitator of chromatin transcription" (FACT) in DNA repair, apoptosis and necrosis after cisplatin

Author: Adelmant Guillaume
Calkins Anne S
Iglehart Dirk J
Lazaro Jean-Bernard
Marto Jarrod
Sand-Dejmek Janna
Sobhian Bijan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Recommended from our members

multiplierz: An Extensible API Based Desktop Environment for Proteomics Data Analysis

Author: Askenazi Manor
Blank Nathaniel C
Cashorali Tanya
Ficarro Scott
Marto Jarrod
Parikh Jignesh R
Webber James T
Zhang Yi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/02/2011
Field of study

Background: Efficient analysis of results from mass spectrometry-based proteomics experiments requires access to disparate data types, including native mass spectrometry files, output from algorithms that assign peptide sequence to MS/MS spectra, and annotation for proteins and pathways from various database sources. Moreover, proteomics technologies and experimental methods are not yet standardized; hence a high degree of flexibility is necessary for efficient support of high- and low-throughput data analytic tasks. Development of a desktop environment that is sufficiently robust for deployment in data analytic pipelines, and simultaneously supports customization for programmers and non-programmers alike, has proven to be a significant challenge. Results: We describe multiplierz, a flexible and open-source desktop environment for comprehensive proteomics data analysis. We use this framework to expose a prototype version of our recently proposed common API (mzAPI) designed for direct access to proprietary mass spectrometry files. In addition to routine data analytic tasks, multiplierz supports generation of information rich, portable spreadsheet-based reports. Moreover, multiplierz is designed around a "zero infrastructure" philosophy, meaning that it can be deployed by end users with little or no system administration support. Finally, access to multiplierz functionality is provided via high-level Python scripts, resulting in a fully extensible data analytic environment for rapid development of custom algorithms and deployment of high-throughput data pipelines. Conclusion: Collectively, mzAPI and multiplierz facilitate a wide range of data analysis tasks, spanning technology development to biological annotation, for mass spectrometry-based proteomics research

Harvard University - DASH

Recommended from our members

An RS Motif within the Epstein-Barr Virus BLRF2 Tegument Protein Is Phosphorylated by SRPK2 and Is Important for Viral Replication

Author: Adelmant Guillaume
Calderwood Michael A
Deng Hongyu
Duarte Melissa
Hill David E.
Johannsen Eric
Marto Jarrod
Ohashi Makoto
Roecklein-Canfield Jennifer
Wang Lili
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 13/05/2013
Field of study

Epstein-Barr virus (EBV) is a gammaherpesvirus that causes infectious mononucleosis, B cell lymphomas, and nasopharyngeal carcinoma. Many of the genes required for EBV virion morphogenesis are found in all herpesviruses, but some are specific to gammaherpesviruses. One of these gamma-specific genes, BLRF2, encodes a tegument protein that has been shown to be essential for replication in other gammaherpesviruses. In this study, we identify BLRF2 interacting proteins using binary and co-complex protein assays. Serine/Arginine-rich Protein Kinase 2 (SRPK2) was identified by both assays and was further shown to phosphorylate an RS motif in the BLRF2 C-terminus. Mutation of this RS motif (S148A+S150A) abrogated the ability of BLRF2 to support replication of a murine gammaherpesvirus 68 genome lacking the BLRF2 homolog (ORF52). We conclude that the BLRF2 RS motif is phosphorylated by SRPK2 and is important for viral replication

Harvard University - DASH

Directory of Open Access Journals

FigShare